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NEURAL CELL ADHESION MOLECULE SPLICING VARIANTS 

HELD OF INVENTION 

5 This invention relates to newly identified splice- variant polynucleotides, 

polypeptides encoded by them and to the use of such polynucleotides and polypeptides, and 
to their production. More particularly, the polynucleotides and polypeptides of the pr^nt 
invention relate to the cell adhesion molecule family, hereinafter referred to as NrCAMvar. 
Hie invention also relates to inhibiting or activating the action of such polynucleotides and 

10 polypeptides. 

BACKGROUND OF THE INVENTION 

The NgCAM-related cell adhesion molecule NrCAM, also called bravo, was first 
identified and characterised in chick by Grumet et fli.,(I991). Although the sequence of 
15 rat NrCAM is not published, it has been cloned and sequenced (Davis and Bennett, 1994). 
This cell siuf ace glycoprotein is a member of the immunoglobulin Qg) superfamily, and is 
very similar in structure to chick NgCAM, human and mouse LI and chick neurofascin. 
Each consists of six Ig domains, five fibronectin type Ill-like repeats, a transmembrane 
domain and an intracellular region. These neural cell surface proteins play a critical role 
20 in nervous system development Studies from Bennett et aL, (Bennett and Gilligan, 1993; 
Davis and Bennett, 1994) suggested that these molecules, including chick and rat NiCAM, 
have ankyrin binding activity suggesting that they may be important in membrane- 
cytoskeletal connections in brain. A role for NrCAM in the in vivo guidance of chick 
commissural neurones has been idenUfied and distinguished from that of NgCAM (Stoeckli 
25 and Landmesser, 1995). Chick NrCAM in floor plate cells together vn\h axonin-1 on 
commissural growth cones is essential for accurate pathfinding at the midline whereas 
NgCAM is required for fascicuiation of the commissural neurites. As well as interacting 
with axonin- 1 , NrCAM can also bind at the cell surface with Fl 1 , another member of the 
Ig superfamily (Morales^r oi., 1993). 
30 Recently a highly conserved human homologue to chick NrCAM was described 

(Lane et aU, Genomics 35 (3). 456-465 (1996)) with 82% amino acid identity to the chick 
protein. The transmembrane arul intracellular domains of human NrCAM are 100% 
identical to the chick homologue while percent identities for individual extracellular 
domains vary from 66% for IgVI to 93% for IglV. Lane et at . identified two alternatively 
35 spliced exons, AE12 encoding a 12-amino-acid section 5' to FNIII-5, and AE93 encoding 
the 93-amino-acids coaesponding to the whole of FNIII-5 (Figure 2). Four different 
isoforms were found: with both AE12 and AE93. with only AE12 or AE93, and without 
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either AE12 or AE93. In addition to AE12 and AE93, two more splice variants have been 
identified in chick. AE19 and AEIO. AE19 encodes a 19-anuno-acid section between IgU 
and Iglll while AEIO is a 10 amino-acid section between IgVI and FNIII-1 (Gnimet et oL, 
1991). Using human NiCAM probes, Lane et al, observed one major RNA band of 
5 -7.0kb in multiple brain tissues including amygdala, caudate nucleus, corpus callosum, 
hippocampus, hypothalamus, substantia nigra, subthalamic nucleus, and thalamus. In 
chick, the same size of RNA was found in brain tissue but not in embryo heart, gizzard oc 
liver on Northern blots. 

This indicates that these cell adhesion molecules have an established importance in 
10 vertebrate devdc^ihem and are consequentiy candicbtes for tteapaitic targ^^ Clearly there is 
a need for identification and characterization of further members and variants, including splice 
variants, of the cell adhesion molecule family which can play a role in preventing, ameliorating 
or correcting dysfunctions a: diseases. 

15 SUMMARY OF THE INVENTION 

In one aspect, the invention relates to NrC AMvar polypeptides and recombinant 
materials and metiiods for their productioa AiK3ther aspea of the invention relates to methods 
for using such NrCAMvar polypeptides and polynucleotides. Such uses include the treatn^nt 
of diabetes, obesity and career, among otiiers. In still another aspect, the invention relates to 

20 methods to identify agonists and antagonists using the materials provided by the invention, 
and treating conditions associated with NrCAMvar imbalance with the identified compounds, 
including diabetes, obesity and cancer. Yet another aspect of the invention relates to diagnostic 
assays for detecting diseases associated with inappropriate NrCAMvar activity or levels, 
including diabetes, obesity and cancer. 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the nucleotide and deduced amino acid sequence of a human 
NrCAMvar; SEQ ID NOS: 1 and 2, respectively. 

Figure 2 shows a comparison of the sequences of human NrCAMvar of the present 
30 invention and human and chick NrCAM cDNAs, 



DESCRIPTION OF THE INVENTION 
Definitions 

The following definitions are provided to facilitate understanding of certain terms 
35 used frequentiy herein. 



2 
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"NrCAMvar" refers, among others, generally to a polypeptide having the amino 
acid sequence set forth in SEQ ID NO:2 or an allelic variant thereof. 

"NrCAMvar activity or NrCAMvar polypeptide activity" or ''biological activity of 
the NrCAMvar or NrCAMvar polypeptide" refers to the metabolic or physiological 

S function of said NrCAMvar including similar activities or improved activities or these 
activities with decreased undesirable side-effects. Also included are antigenic and 
inunimogenic activities of said NrCAMvar. 

"NrCAMvar gene" refers to a polynucleotide having the nucleotide sequence set 
forth in SEQ ID NO; 1 or allelic variants thereof and/or their complements. 

10 "Antibodies" as used herein includes polyclonal and monoclonal antibodies. 

chimeric, single chain, and humanized antibodies, as well as Fab fragments, including the 
products of an Fab or otiier immunoglobulin expression library. 

"Isolated" means altered "by the hand of man" from the natural state. If an 
"isolated" composition or substance occurs in nature, it has been changed oc removed from 

IS its original envirorunent, or both. Fc^ example, a polynucleotide or a polypeptide naturally 
present in a living animal is not "isolated," but the same polynucleotide or polypeptide 
separated from the coexisting materials of its natural state is "isolated", as the term is 
employed hereia 

'Tolynucleotide" generally refers to any polyribonucleotide or 

20 polydeoxribonudeotide, which may be uiunodiGed RNA or DNA or modified RNA ch* 
DNA, "Polynucleotides" include, without limitation single- and double-stranded DNA, 
DNA that is a mixture of single- and double-stranded regions, single- and double-stranded 
RNA, and RNA that is mixture of single- and double-stranded regions, hybrid molecules 
comprising DNA and RNA that may be single-stranded or, more typically, double- 

25 stranded or a mixture of single- and double-suanded regions. In addition, "polynucleotide" 
refers to triple-stranded regions comprising RNA or DNA or both RNA and DNA. The 
term polynucleotide also includes DNAs or RNAs containing one or more modified bases 
and DNAs or RNAs with backbones modified for stability or for other reasons. 
"Modified" bases include, for example, tritylated bases and unusual bases such as inosine. 

30 A variety of modifications have been made to DNA and RNA; thus, "polynucleotide" 

embraces chemically, enzymatically or metabolically modified forms of polynucleotides as 
typically found in nature, as well as the chemical forms of DNA and RNA characteristic of 
viruses and cells. "Polynucleotide" also embraces relatively short polynucleotides, often 
referred to as oligonucleotides. 

35 "Polypeptide" refers to any peptide or protein comprising two or more amino adds 

joined to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres. 
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'Tolypeptide" refers to both short chains, commonly referred to as peptides, oligopeptides 
or oligomers, and to longer chains, generally referred to as proteins. Polypeptides may 
contain amino acids other than the 20 gene-encoded amino acids. "Polypeptides" include 
amino acid sequences modified either by natural processes, such as posttranslational 
S processing, or by chemical modification techniques which are well known in the ait Such 
modifications are well described in basic texts and in more detailed monographs, as weU as 
in a voluminous research literature. Modifications can occur anywhere in a polypeptide, 
including the peptide backbone, the amino acid side-chains and the amino or carboxyl 
termini. It will be appreciated that the same type of modification may be present in the 

10 same or varying degrees at several sites in a given polypeptide. Also, a given polypeptide 
may contain many types of modifications. Polypeptides may be branched as a result of 
ubiquitinaiion, and they may be cyclic, with or without branching. Cyclic, branctied and 
branched cyclic polypeptides may result from postiranslation natural processes or may be 
made by synthetic methods. Modifications include acetylation, acylation, ADP- 

1 5 ribosylation. amidation, covalent attachment of flavin, covalent attachment of a heme 

moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent attachment 
of a lipid or lipid derivative, covalent attachment of phosphotidylinositol, cross-linking, 
cyclization, disulfide bond formation, demethylation, formation of covalent cross-links, 
formation of cystine, formation of pyroglutamate, formylation, gamma-caiboxylation, 

20 glycosylation, GPI anchor formation, hydroxylation, iodination, methylation, 
myristoylation, oxidation, proteolytic processing, phosphorylation, prenylation, 
racemization, selenoylation, sulfation, transfer-RNA mediated addition of amino acids to 
proteins such as arginylation, and ubiquitination. See, for instance, PROTEINS - 
STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton,W. H. 

25 Freeman and Company, New York, 1993 and Wold, F., Posttranslational Protein 

Modifications: Perspectives and Prospects, pgs. 1-12 in POSTTRANSLATIONAL 
COVALENT MODIHCATION OF PROTEINS, B. C. Johnson, Ed., Academic Press, 
New York, 1983; Seifter et al,, "Analysis for protein modifications and nonprotein 
cofactors", Meth Enzymol (1990) 182:626-646 and Rattan et aL, *Trotein Synthesis: 

30 Posttranslational Modifications and Aging", Ann NY Acad Sci (1992) 663:48-62. 

"Variant" as the term is used herein, is a polynucleotide or polypeptide that differs 
from a reference polynucleotide or polypeptide respectively, but retains essential 
properties. A typical variant of a polynucleotide differs in nucleotide sequence from 
anotiier, reference polynucleotide. Changes in the nucleotide sequence of the variant may 

35 or may not alter the amino acid sequence of a polype ptide encoded by the reference 

polynucleotide. Nucleotide changes may result in amino acid substitutions, additions, 

4 
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deletions, fusions and truncations in the polypeptide encoded by the reference sequence, as 
discussed below. A typical variant of a polypeptide differs in amino acid sequence firom 
another, reference polypeptide. Generally, differences are liniited so thai the sequences of 
the reference polypeptide and the variant are closely similar overall and, in many regions, 

5 identical. A variant and reference polypeptide may differ in amino acid sequence by one or 
more substitutions, additions, deletions in any combination. A substituted oc inserted 
amino acid residue may or may not be one encoded by the genetic code. A variant of a 
polynucleotide or polypeptide may be a naturally occurring such as an allelic variant, or it 
may be a variant that is not known to occur naturally. Non-naturaDy occurring variants of 

10 polynucleotides and polypeptides may be made by mutagenesis techniques or by direa 
synthesis. 

"Identity" is a measure of the identity of nucleotide sequences or amino acid 
sequences. In general, the sequences are aligned so that the highest order match is 
obtained. "Identity" per se has an art-recognized meaning and can be calculated using 

15 pubUshed techniques. See, e.g.: (COMPUTATIONAL MOLECULAR BIOLOGY, Lesk. 
A.M., ed., Oxford University Press, New York, 1988; BIOCOMPUTING: 
INFORMATICS AND GENOME PROJECTS, Smith, D.W., ed,. Academic Press, New 
Yoric 1993; COMPUTER ANALYSIS OF SEQUENCE DATA, PART I. Griffin, A,M., 
and Griffin, H.G., eds., Humana Press, New Jersey, 1994; SEQUENCE ANALYSIS IN 

20 MOLECULAR BIOLOGY, von Heinje, G., Academic Press, 1987; and SEQUENCE 
ANALYSIS PRIMER, Gribskov, M. and Devereux, J., eds., M Stockton Press, New 
York, 1991). While there exist a number of methods to measure identity between two 
polynucleotide or polypeptide sequences, the term "identity" is well known to skilled 
artisans (Carilio, H., and Lipton, D., SIAM J Applied Math (1988) 48: 1073). Methods 

25 commonly employed to determine identity or similarity between two sequences include, but 
are not limited to, those disclosed in Guide to Huge Computers, Martin J. Bishop, ed.. 
Academic Press, San Diego, 1994, and Carilio, H., and Lipton, D.. SIAM J Applied Mash 
(1988) 48:1073. Methods to determine identity and similarity are codified in computer 
programs. Preferred computer program methods to deterniine identity and similarity 

30 between two sequences include, but are not limited to, GCS program package (Devereux, 
J., et al.. Nucleic Acids Research (1984) I2(l):387), BLASTP, BLASTN, FASTA 
(Atschul, S.F. et aL.JMolec Biol (1990) 215:403). 



35 



The invention discloses a new splice variant of NrC AM (termed NrCAMvar) which 
comprises the AE10K2 sequence, which is absent in the published human NrC AM sequence 
(Lane oil, 1996) but which is present in tl^ chick sequence. In addition the NrCAMvar does 
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not have the AEIOKI sequence which is present in the Lane et aL human sequence (Figure 2). 
NrC AMvar is expressed at high levels in the brain, pancreas and adrenal cortex and at lower 
levels in placenta, adrenal medulla, thyroid and testis. published human NrC AM, 
however, appears not to be expressed in the pancreas, 

5 

Polypeptides of the Invention 

In one aspect, the present invention relates to novel NrCAMvar polypeptides. Tte 
NrCAMvar polypeptides include the polypeptide of SEQ ID NO:2; as weU as polypeptides 
comprising the amino acid sequence of SEQ ID NO: 2. Preferably NrCAMvar polypeptide 

1 0 exhibit at least one biological activity of NrCAMvar. 

The NrCAMvar polypeptides may be in the form of the "mature" protein or may 
be a part of a larger protein such as a fusion protein. It is often advantageous to include 
an additional amino acid sequence which contains secretcxy or leader sequences, pro- 
sequences, sequences which aid in purification such as multiple histidine residues, or an 

15 additional sequence for stability during recombinant productioa 

Bidpgically active fragments of the NrCAMvar polypeptides are also included in the 
invention. A fragment is a polypeptide having an amino add sequemx that entirely is the same 
as part, but not all, of the amino acid sequence of the afOTcmentioned NrCAMvar polypeptides. 

As with NrCAMvar polypeptides, fragments may be "free-standing," or comprised within a 

20 larger polypeptide of wWch they form a part ot region, most preferably as a single continuous 
regioa 

Preferred fragments include, for example, truncation polypeptides having the amino 
add sequence of NrCAMvar polypeptides, except for ddetion of a continuous series of 
residues that includes the amirx) terminus, cr a continuous series of residues that iix:iudes the 

25 carboxyl terminus or deletion of two continuous series of residues, or^ inr^iding the amino 
terminus and or^ induding the carboxyl terminus. Also preferred are fragnoents charactaized 
by structural or functional attributes such as fragments that comprise alpha-hdix arKl alj^- 
Mix forming regions, beta-sheet and beta-sheet-forming regions, turn and tum-foTning regions, 
coil and coil-forming regions, hydrophilic regions, hydrophobic regions, alpha an^hipatiiic 

30 regions, beta amphipatiiic regions, flexible regions, surface-forming regions, substrate binding 
region, and high antigenic index regions. Biologically active fragments are those that mediate 
NrCAMvar activity, including tiiose with a similar activity or an improved activity, or with a 
decreased uiKlesirable activity. Also included are tixase that are antigenic or imnumogenic in an 
animal, espedally in a humaa 

35 Preferabl y, all of tiiese polypeptide fragments r^in the Mnlngicni activity of the 

NrCAMvar, induding antigenic activity. Variants of tiie (fefined sequence and fragments also 

6 
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fain part of the present inventioa Preferred variants are those that vary frcm the referents by 
conservative amino acid substitutions - i.e., those that substitute a residue with aiKXher of like 
characteristics. Typical such substitutions are among Ala, VaU Leu and He; among Ser and 
Ttm among the acidic residues Asp and Giu; anxDng Asn and Gin; and among the basic residues 
5 Lys and Arg; aromatic residues Phe and Tyr. 

The NiC AMvar polypeptides of the invention can be prqjared in any suitable manner. 
Such pdypq5tides include iscdated naturally occurring polypeptides, recombinantly produced 
polypq)tides, synthetically produced polypeptides, or polypeptides produced by a oombinatioa 
of these mediods. Means for preparing such polypeptides are well understood in the art 

10 

Polynucleotides of the Invention 

Another aspea of the invention relates to NrC AMvar polynudeoiides. NrCAMvar 
polynucleotides include isolated polynucleoddes which encode the NrCAMvar polypeptides and 
fragments, and polynucleotides closdy related thereto. More specifically, NrCAMvar 
IS polynucleotide of the invention include a polynucleotide comprising the nudeodde sequence set 
facth in SEQ ID NO: I encoding a NrCAMvar polypeptide of SEQ ID NO: 2, and 
polynucleotides having the particular sequence of SEQ ID NO: I . Also included under 
NrCAMvar polynucleotides are a nucleotide sequence which has sufficient identity to a 
nucleotide sequence contained in SEQ ID NO:l to hybridize under conditions useable for 
20 amplification or for use as a probe or marker. The invention also provides polynucleotides 
which are complementary to such NrCAMvar polynucleotides. 

NrCAMvar of the invention is structurally related to other (xoteins of the cell adhesion 
nx)lecules, as shown by die results of sequencing the cDN A encoding human NrCAMvar, The 
cDNA sequence contains an open reading frame encoding a pdlypq)tide of 1304 amiix> adds. 
25 Amino add of sequence of Figure 1 (SEQ ID N0:2) has about >99% identity (using BlastP) in 
1299 amino add residues with Human NrC AM (Lane. RP et al. Genomics 35 (3), 456-465 
(1996)). Nudeotide sequence of Figure 1 (SEQ ID NO:l) has about >99% identity (using 
BlastN) in 3897 nucleotide residues witii Human NrC AM (Genomics 35 (3). 456-465 (1996)). 
Figure 2 shows the splice variant AEl OK. 
30 One polynucleotide of the present invention encoding NrCAMvar may be obtained 

using standard doning and screening, from a cDN A library derived from mRNA in cells of 
human adrenal using the expressed sequence tag (EST) analysis (Adams, M.D., et al 
Science (1991) 252:1651-1656; Adams, M.D. et aL Nature, (1992) 555:632-634; 
Adams, M.D., et al. Nature (1995) 377 Supp:3-174). Polynucleotides of the invention 
35 can also be obtained from natural sources such as genomic DNA libraries or can be 
synthesized using well known and conunercially available techniques. 
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The nucleotide sequence encoding NrCAMvar polypeptide of SEQ ID NO:2 may 
be identical over its entire length to the coding sequence set forth in Figure 1 (SEQ ID 
NO: 1), or may be a degenerate form of this nucleotide sequence encoding the polypeptide 
of SEQ ID N0:2, or may be highly identical to a nucleotide sequence that encodes the 
5 polypeptide of SEQ ID NO: 1 

When the polynucleotides of the invention are used for the recombinant production 
of NiC AMvar polypeptide, tiie polynucleotide may include tiie coding sequence for the 
mature polypeptide or a fragment thereof, by itself; the coding sequence for the mature 
polypeptide or fragment in reading frame with other coding sequences, such as those encoding a 
1 0 leader or secretory sequence, a pre-, cr pro- ot prepro- protein sequence, or other fusion peptide 
pcxtions. For example, a marker sequence which facilitates purification of tiie fused polypeptide 
_can be encoded. _In certain preferred embodiments of this aspect of the invention, the marker 
sequence is a hexa-histidine peptide, as provided in Uie pQE vector (Qiagen, Inc.) and described 
in Gensz etaL. Proc Nail Acad Sci USA (1989) 86:821-824, or is an HA tag. The 
15 polynudeotide may also contain non<aling 5' and 3' sequences, such as transcri^ 

translated sequences, splicing and polyadenylation signals, ribosome binding sites and sequences 
that stabilize mRN A. 

Further preferred embodiments are polynucleotides encoding NfCAMvar variants 
comprise the amino add sequence NrCAMvar polypeptide of Figure 1 (SEQ ID NO:2) in 
20 whidi several, 5-10, 1-5, 1-3, 1-2 a* 1 amino add residues are substituted, ddeted or added, in 
any combinatioa 

The present invention further relates to polynucleotides that hybridize to the herein 
above-described sequences. In this regard, tiie present invention especially relates to 
polynudeoddes which hybridize under stringent conditions to the herein above-described 

25 polynucleotides. As herdn used, tiie term "strlngem conditioiis" n^ans hybridization wifl oc^ 
only if tfiere is at least 95% and preferably at least 97% identity t>etween tiie sequences. 

Polynucleotides of tiie invention, which are identical cr suffidentiy identical to a 
nucleotide sequence contained in SEQ ID N0:1, may be used as hybridization probes for 
cDN A and genc»nic DNA, to isolate fuU-lengtii cDNAs and genomic denies encoding 

30 NrCAMvar polypeptide and to isolate cDN A and genomic dones of other genes tiiat have a 
high sequence similarity to ttie NrCAMvar gene. Such hybridization techniques are known to 
ttiose of skiU in tiie art Typically tiiese nucleotide sequences are 70% identical, preferably 80% 
identical more preferably 90% identical to tiiat of tiie referent The probes generally win 
comprise at least 15 nucleotides. Preferably, such probes will have at least 30 nucleotides and 

35 may have at least 50 nucleotides. Particularly preferred probes wiU range between 30 and 50 
nudeotides. 



8 
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In one embodiment, to dbtom a polynucleoticJe encoding NrC AMvar comprises the 
steps of screening an appropriate library under stingent hybridization conditiwis with a labeled 
probe having the SEQ ID NO: I or a fragment thereof, and isolating full-length cDN A and 
genomic clones ccMitaining said polynucleotide sequence. Such hybridization tediniques arc well 
5 known to those of skin in the art. Stringent hybridization conditions are as defined above cr 
alternatively conditions under overnight incubation at 42% in a solution comprising: 50% 
formamide, 5xSSC (150mM NaQ, 15mM trisodium citrate), 50 mM sodium phosptote 
(pH7.6), 5x Denhardt's sdutioa 10 % dextran sulfate, and 20 micrpgram/ml denatured, ^saied 
salmon sperm DNA, followed by v^rashing the filters in 0. Ix SSC at about 65%. 
10 The pc^ynucleotides and polypeptides of the present invention may be enq>loyed as 

research reagents and materials ftx* discovery of treatments and diagnostics to animal and 
human disease. 

Vectors, Host Cells, Expression 

15 The present invention also relates to vectors whidi ccanprise a polynucleotide or 

polynucleotides of the present invention, and host cells which are gen^cally engineered with 
vectors of the invention and to the production of polypeptides of the inventiCHi by recombinant 
techniques. Cell-free translation systems can also be employed to produce such proteins using 
RNAs derived from the DNA constructs of the present invention. 

20 For recanbinant production, host cells can be genetically engineered to incorporate 

expression systems or portions thereof for polynucleotides of the present invention. Introduction 
of polynucleotides into host cells can be effeaed by methods described in many standard 
labOTatCKy manuals, such as Davis et aL , BASIC METHODS IN MOLECULAR BIOLOGY 
(1986) and Sambrook et al„ MOLECULAR CLONING: A LABORATORY MANUAL, 2nd 

25 Ed., Cold Spring Harbor LabOTatCHy Press. Cold Spring Harbor, N.Y. (1989) such as calcium 
phosphate transfectiwi, DEAE-dextran mediated transfection, transvecticai, microinjection, 
cationic lipid-mediated transfection, dectroporation, transduction, scrape loading, ballistic 
introduction or infectioa 

Representative examples of appropriate hosts include bacterial cells, such as 

30 streptococci, staphylococci, £1 coU, Streptomyces and Bacillus subtilis cells; fungal cells, sudi 
as yeast cells :sxtil Aspergillus cells; insect cells such as Drosophila S2 and Spodoptera Sf9 
cells; animal cells such as CHO, COS. HeLa, C127, 3T3, BHK, 293 and Bowes mdanoraa 
cells; and plant cells. 

A great variety of expression systems can be used Such systems include, among 

35 others, chromosomal, episomal and virus-derived systems, e.g., vectors derived from bacterial 
plasmids, from bacteriophage, from transposons, from yeast episraies, from insertion elements, 



wo 98/36062 



PCT/GB98/00434 



from yeast chromosomal elements, from viruses such as baculoviruses, papova viruses, such as 
S V40, vaccinia viruses, adenoviruses, fov^ pox viruses, pseudorabies viruses aixl retraviniseSt 
and vectors derived from combinations thereof, such as those derived from plasmid and 
bacteriophage genetic dements, sudi as cosmids and phagemids. The expression systems may 
5 contain control regions that regulate as weD as engender expression. Generally, any system or 
vecta* suitable to maintain, propagate or exjress polynucleotides to produce a pcdypeptide in a 
host may be used. The apprcpriate nucleotide sequence may be inserted into an e^qjression 
system by any of a variety of well-known and routine techniques, such as, for example, those set 
fOTth in Sambrook et al,, MOLECUIAR CLONING, A LABORATORY MANUAL {supra). 

1 0 For seaetion of the translated protein into the lumen of the endoplasmic reticulum, into 

the periplasmic space or into the extracellular environment, apprqjriate secretion signals may 
be incorporated into the desired polypeptide. These signals may be endogenous to the 
polypeptide or they may be heterologous signals. 

If the NiCAMvar polypeptide is to be expressed for use in screening assays, generally, 

15 it is preferred that the polypeptide be produced at the surface of the ceil. In this event, the 
cells may be harvested prior to use in the screening assay. If NrCAMvar polypeptide is 
secreted into the medium, the medium can be recovered in order to recover and purify the 
polypeptide; if produced intracellularly, the cells must first be lysed before the polypeptide 
is recovered. 

20 NrCAMvar polypeptides can be recovered and purified ftom recombinant cell cultures by weD- 
known methods including ammonium sulfate or ethanol precipitation, add extraction, anion or 
cation exchange diromatography. phosphoceUulose chromatography, hydrophobic interaction 
chromatography, affinity chromatography, hydroxylapatite chrcxnatography and lectin 
chromatography. Most preferably, high performance liquid chromatography is employed for 

25 purificatioiL WeD known techniques for refolding prc^eins may be employed to regenerate 
active conformation when the polypqjtide is denatured during isolation and or purificatiOT. 

Diagnostic Assays 

Tlus invention also relates to the use of NiCAMvar polynucleotides for use as 
30 diagnostic reagents. Detection of a mutated form of NrCAMvar gene associated with a 

dysfunction will provide a diagnostic tool that can add to or define a diagnosis of a (fisease or 
susceptibility to a disease which results from under-expressira, over-expression or altered 
expression of NrCAMvar. Individuals carrying mutations in the NiCAMvar gene may be 
detected at the DN A levd by a variety of techniques. 

„35 Nucleic acids fey d iagnosis may he nhtaineri from a subject's ceOs, such as 

from blood, urine, saliva, tissue biopsy ca* autopsy material The genomic DNA may be used 

10 
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directly for cietection or may be amplified enzymatically by using PCR or cxher an^)lificatiCBi 
techniques prior to analysis. RN A or cDNA may also be used in similar fashion. Deleticffis and 
insertions can be detected by a change in size of the amplified product in comparison to the 
normal genc^ype. Point mutations can be identified by hybridizing an^lified DNA to labeled 

5 NrCAMvar nucleotide sequences. Perfectly matched sequences can be distinguished frcm 
mismatched duplexes by RNase digestion or by differences in melting temperatures. DNA 
sequence differences may also be detected by alterations in electrq)hor^c mobility of DNA 
firagmems in gels, with a: without denaturing agents, or by direa DNA sequencing. See, e.g., 
Myers et al , Science (1985) 230: 1 242. Sequence changes at ^)edfic locations may also be 

1 0 revealed by nuclease protection assays, such as RNase and S 1 protection or the chemical 
cleavage method. See Cotton et al, , Froc Nail Acad Sci USA (1985) 85: 4397-4401 . In 
another embodiment, an array of oligonucleotides probes comprising NrCAMvar 
nucleotide sequence or fragments thereof can be constructed to conduct efficient screening 
of e.g., genetic mutations. Array technology methods are well known and have general 

15 applicability and can be used to address a variety of questions in molecular genetics 
including gene expression, genetic linkage, and genetic variability. (See for example: 
M.Chee et al.. Science, Vol 274, pp 610-613 (1996)). 

The diagnostic assays offer a process for diagnosing or determining a suscqstibility to 
Diab^, obesity and cancer through detection of mutation in the NrCAMvar gene by the 

20 m^hods described 

In addition. Diabetes, obesity and cancer, can be diagnosed by methods comprising 
determining from a sample derived from a subjea an abnormally decreased or increased 
level of NrCAMvar polypeptide or NrCAMvar mRNA. Decreased or increased expression 
can be measured at the RN A level using any of the methods well known in the art for the 

25 quantitation of polynucleotides, such as, for example, PCR, RT-PCR, RNase protection. 
Northern blotting and other hybridization methods. Assay tedmiques that can be used to 
determine levels of a protdn, such as an NrCAMvar polypeptide, in a sample derived fixim a 
host are well-known to those of skill in the art Such assay methods include 
radioimmurKXissays, competitive-biiKiing assays. Western Blot analysis and FT .ISA assays. 

30 

Chromosome Assays 

The nucleotide sequences of the present invention are also valuable for chromosome 
identificatioa The sequence is specifically targeted to and can hybridize with a particular 
location on an individual human chromosome. The mapping of relevant sequences to 
35 chromosomes according to the present inveruion is an impcrtant first step in correlating those 
sequences with gene associated disease. Once a sequence has been mapped to a precise 

11 
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chrOTiosomaJ location, tlK physical position of the sequence on the chmnxKonK can be 
correlated with gen^cniap data. Such data are found, for example, in V. McKusick. 
Mendelian Inheritance in Man (avaOable on line through Johns HopkiiB University Welch 
Medical library). The relationship between genes and diseases that have been nuRjed to the 
5 san)e chronwsomal region are then identified through linkage analysis (coinheritance of 
physically adjacent genes). 

The differences in the cDNA or genomic sequence between affected and unaffected 
individuals can also be determined. If a mutation is observed in some or all of the affected 
individuals but not in any normal individuals, then the mutation is likely to be the causative 
1 0 agent of the disease. 

Antibodies 

The polypeptides of the invention or their fragments or analogs thereof, or cdls 
expressing them can also be used as immunogens to produce antibodies immunospedfic for the 

15 NrC AM var polypeptides. Tte term "immunospedfic" means that the antibodies have 

substantial] greater affinity for the polypeptides of the invention than thdr affinity for other 
related polypeptides in the prior art 

Antibodies generated against the NiCAMvar polypeptides can be obtained by 
administering the polypeptides or epitope-bearing fragnnents, analogs or cells to an animal, 

20 preferably a nonhuman, using routine protocols. For pre>aration of monoclonal antibodies, any 
technique which provides antibodies produced by continuous cell line cultures can be used 
Examples include the hybridoma technique (KoWer, G. and Milstein, C, Nature (1975) 
256:495-497), the trioma technique, the human B-cell hybridoma technique (Kozbor etoL, 
Immunology Today (1983) 4:72) and the EBV-hybridoma technique (Cde et oL, 

25 MONOCLONAL ANTIBODIES AND CANCER TOERAPY, pp. 77-96, Alan R. Liss, Inc., 
1985). 

Tediniques for the production of single chain antibodies (U.S. Patent Na 4,946,778) 
can also be adapted to produce single chain antibodies to polypq)tides of this invaitioa Also, 
transgenic mice, or other crganisms including other mammals, may be used to express 
30 humanized antibodies. 

The above-described antibodies may be employed to isolate or to identify clones 
exiM-essing the polypepUde or to purify the polypeptides by affinity chromatography. 

Antibodies against NiCAMvar polypeptides may also be en^loyed to treat Diab^, 
obesity and cancer, among others. 

35 . 

Vacdnes 
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Another aspea of the invention relates to a method for inducing an immunological 
response in a manmial which comprises inoculating the mammal with NrCAMvar 
polypeptide, or a firagmenl thereof, adequate to produce antibody and/or T cell immune 
response to protect said animal from Diab^, obesity and cancer, among others. Yet 

5 another aspect of the invention relates to a method of inducing immunological response in 
a manunal which comprises, delivering NrCAMvar polypeptide via a vector directing 
expression of NrCAMvar polynucleotide in vivo in order to induce such an inmiunological 
response to produce antibody to protect said animal from diseases. 

Further aspect of the invention relates to an inununological/vaccine formulation 

10 (composition) which, when introduced into a manunalian host, induces an immunological 
response in that mammal to a NrCAMvar polypeptide wherein the composition comprises a 
NrCAMvar polypeptide or NrCAMvar gene. The vaccine formulation may further 
comprise a suitable carrier. Since NrCAMvar polypeptide may be broken down in the 
stomach, it is preferably administered parenterally (including subcutiuieous, intramuscular, 

15 intravenous, intradermal etc. injection). Formulations suitable for parenteral 

administration include aqueous and non-aqueous sterile injection solutions which may 
contain antioxidants, buffers, bactenostats and solutes which render the formulation 
instonic with the blood of the recipient; and aqueous and non-aqueous sterile suspensions 
which may include suspending agents or thickening agents. Hie formulations may be 

20 presented in unit-dose or multi-dose containers, for example, sealed ampoules and vials 
and may be stored in a freeze-dried condition requiring only the addition of the sterile 
liquid carrier immediately prior to use. The vaccine formulation may also include adjuvant 
systems for enhancing the immunogenicity of the formulation, such as oil-in water systems 
and other systems known in the art. The dosage will depend on the specific activity of the 

25 vaccine and can be readily determined by routine experimentatioiL 

Screening Assays 

The NrCAMvar polypeptide of the presertt invention may be employed in a screening 
process fey compounds which activate (agonists) <x inhibit activation of (antagonists, or 

30 otherwise called inhibitor) the NrCAMvar polypeptide of the present invention. Thus, 

polypeptides of the invention may also be used to assess identify agonist or antagonists from, for 
example, cells, cell-free preparations, chemical libraries, and natural produa rmxtures. These 
agonists or antagonists may be natural substrates, ligands, receptors, etc., as tiie case may be, of 
the polypeptide of tiie present invention; (x may be structural (x functioruil mimetics of the 

35 polypeptide of the present inventioa S&^Co^gimetaL, Current Protocols in Immunology 
l(2);Chapter5(1991). 
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NrC AMvar {xjlypeptides are ubiquitous in the mammalian hC3St and are responsible 
many biological functions, including many pathologies. Accordingly, it is desirous to find 
compounds and drugs which stimulate NrCAMvar polypeptide on the one hand and viiiich can 
inhibit the fiinction of NrCAMvar polypeptide on the other hand In geiKraU agonists are 
5 employed for therapeutic and prophylactic purposes for such conditions as Diabetes, obesity 
and cancer. Antagonists may be en5)loyed fa- a vari^ of therapeutic arel prophylactic 
purposes for such conditions as Diabetes, obesity and cancer, 

In general, such screeniiig procedures may involve using appropriate CCDs which 
express the NrCAMvar polypqHide CH- respond to NrCAMvar polypqjtide of the fTCS^ 

10 invention. Sxich cells include cdls from mammals, yeast, DrosophUa or £1 colL Cells v^ch 
express tte NrCAMvar polypeptide (or cell membrane containing the expressed pdypejrtide) cr 
respond to NrCAMvar polypeptide are then contacted with a test compound to observe biwJing, 
or stimulaticm or inhibition of a functiaial response. The ability of the cells v^ch were 
contacted with the candidate conpounds is con^)ared with the same cells w4iich were xxst 

1 5 contacted for NrCAMvar activity. 

The assays may simply test binding of a candidate compound wherein adherence to 
the cells bearing the NrCAMvar polypeptide is detected by means of a label directly or 
indirectly associated with the candidate compound or in an assay involving competition 
with a labeled competitor. Further, tiiese assays may test whether the candidate compound 

20 results in a signal generated by activation of the NrCAMvar polypeptide, using detection 
systems appropriate to the ceUs bearing the NrCAMvar polypeptide. Inhibitors of 
activation are generally assayed in tiie presence of a known agonist and tiie effect on 
activation by the agonist by the presence of the candidate compound is observed. Standard 
methods for conducting such screening assays are well understood in the art 

25 Examples of potential NrCAMvar poIypq)tide antagonists include antibocfies or, in 

some cases, oligonudecHides or proteins wWch are closely related to the ligands, substrates, 
receptors, etc., as tiie case may be, of the NrCAMvar polypeptide, e.g., a fragment of the 
ligands, substrates, recqjtc^s, or small molecules which bind to the polyp^de of the present 
invention but do not elicit a response, so that tiie activity of tiie polypeptide is prevented. 

30 

Prophylactic and Therapeutic Methods 

This invention provides metiiods of treating an abncxmal conditions related to botti an 
excess of and insufficient amounts of NiCAM var polypeptide activity, including dial«es, 
obesity and cancer. 

If ti^ activity of NrCAMvar p olype ptide is in excess , several a p proaches are available. 

One approach comprises administering to a subjea an inhibitor compound (antagonist) as 

14 
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hereinabove described along with a pharmaceutically accqjtable earner in an amount effective 
to inhibit activation by blocking binding of ligands to the NiC AM var polypeptide, cr by 
inhibiting a second signal, and ther^y alleviating the abnormal conditioa 

In another approach, soluble forms of NrCAMvar polypeptides still capable of 
5 binding the ligand in competition with endogenous NrCAMvar polypeptide may be 
administered Typical embodiments of such competitors comprise fragments of the 
NiCAMvar polypeptide. 

In still another approach, expression of the gene encoding endogenous NrCAMvar 
polypeptide can be inhibited using expression blocking techniques. Known such tetimiques 

10 involve the use of antisense sequences, either internally generated or separately 
administered See, fcff example, O'Connor, JNeurochem (1991) 56:560 in 
QligQdeoxvnucleotides as Antisense I nhibitors of Gene Expression . CRC Press. Boca 
Raton, FL (1988). Alternatively, oligonucleotides which form triple helices with the gene 
can be supplied See, for example, Lee et al. Nucleic Acids Res (1979) 6:3073; Cooney 

15 et aL Science (1988) 241:456; Dervan et aL, Science (1991) 251:1360. These oligomers 
can be administered per se or the relevant oligomers can be expressed in vivo. 

Foe treating abncamal conditions related to an under-expression of NrCAMvar and its 
activity, several approaches are also available. One approach comprises administerixig to a 
subjea a therapeutically effecUve amount of a con^xxmd which activates NrCAMvar 

20 polypq)tide, Le., an agonist as described above, in combination with a pharmaceutically 

acceptable carrier, to thereby alleviate the abncrmal conditioa Alternatively, gene therapy may 
be employed to effect the endogenous production of NrCAMvar by the relevant cells in the 
subject For exanple, a polynucleotide of the inveiuion may be engineered fiy expression in a 
replication defective retroviral vectcx, as discussed above. The retroviral expression construct 

25 may then be isolated and introduced into a packaging cell transduced with a retroviral plasmid 
vector containing RNA encoding a polypeptide of the present invention such that the packaging 
cell now produces infectious viral particles containing the gene of interest These producer 
may be administered to a subject fcH* engineering cells in vivo and expression of the polypq)tide 
in vivo. For overview of gene therapy, see Chapter 20, Gene Therapy and other Molecular 

30 Genetic-based Therapeutic Approaclies^ (and references cited therein) in Human Molecular 
Genetics, T Strachan and A P Read BIOS Scientific Publishers Ltd (1996). 

Formulation and Administration 

Peptides, such as tiie soluble form of NrCAMvar polypqjtides, and agraists and 
35 antagonist peptides or small molecules, may be formulated in combination with a suitable 

pharmaceutical carrier. Such formulations comprise a therapeutically effective amount of the 

15 
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polypeptide ck compound, aiKl a phannacaiticaUy acceptable carrier or exdpienL Such carriers 
include but are not limited to, saline, buffered saline, dextrose, water, glycerol, ethanol, and 
combinations thereof. Fcrnmlation should suit the mode of administnition, and is within 
iJie skill of the art The invention further relates to pharmacaitical packs and kits comprising 
5 one or more containers filled with one or maie of the ingredients of ttie aforementi(Med 
compositions of the invoitioa 

PoJypeptides and other compouittls of the presem invention niay be employed aloiK 
in conjunction with other compounds, such as therapeutic con^xxinds. 

Prefenred forms of systemic administration of the pharmaceutical conqxjsitions include 

10 injection, typically by intravenous injectioa Other injection routes, such as subcutaneous, 
intramuscular, or intraperitoneal, can be used. Alternative means fior systemic administration 
include transmucosal and transdermal administration using penetrants such as bile salts or 
fusidic adds or other detergents. In addition, if prqierly fcrmulated in enteric or etKajKulated 
formulations, cxxU administration may also be possible. Administration of these compounds 

1 5 may also be tqrical and/or localized, in tiie fmn of salves, pastes, gels and tiie like. 

The dosage range required depends cm the choice of peptide, the route of 
administration, tfie nature of the formulation, the nature of the subjea's condition, and the 
judgment of the attending practitioner. Suitable dosages, however, are in the range of 0. 1-100 
Hg/kg of subject Wide variations in the needed dosage, however, are to be expected in ^ew of 

20 the variety of con^xxinds avaflable and the differing efficiencies of various routes of 

administratioa For exanple, oral administration would be expected to require higher dosages 
than administration by intravenous injectioa Variations in these dosage levels can be adjusted 
using standard empirical routines for optimization, as is well understood in the art 

Polypeptides used in treatment can also be generated endpgenously in the subject, in 

25 treatment modalities often referred to as "gene therapy" as described above. Thus, for example, 
cells from a subject may be engineered witii a polynucleotide, such as a DNA or RNA, to 
encode a polypeptide ex vivo, and fey example, by tiie use of a retroviral plasmid vector. The 
cells are then introduced into the subject 

30 Examples 

The examples bdow are carried out using standard techniques, which are wdl known 
and routine to those of skill in tiie art, except where ottierwise described in detail. The examples 
illustrate, but do not limit the inventioa 

35 ExampleJ 
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rinning of human NrCAMvar cDNA 

The HGS EST database was screened using the chick NiCAM sequence and three HGS 
EST clones (EST99669, EST237133. and EST373834) were obtained. EST99669 and 
EST237133 clones were from human adrenal cDNA Ubrary while EST373834 was ftom 

5 human striatum cDN A library, cDN A clones EST237 133 and EST373834 contained 
several EcoRl or Eco RUXIw I fragments, suggesting that inserts ftom several different 
genes may be present Only those fragments containing the EST sequence homologous to 
NrC AM were subcloned and used fcff further characterisation. These clones were end 
sequenced and used as probes labelled with [a'^'^P](lCTP (Amersham) to screen a human 

10 fetal brain (20-wk) Xgtl 1 cDNA library (Qontech), Four positive cDNA clones were 
isolated and the inserts were cloned into pBluescript plasmids (Maniatis et al., 1982). 
Additional sections of the gene were isolated using gene-specific primers to amplify cDNA 
from the human fetal brain Marathon'^ cDNA (Qontech) and the X.gtl 1 fetal brain 
cDNA library. All sequencing was performed on an ABI373 sequencer using the ABI 

15 PRISM'^ dye terminator cycle sequencing ready reaction kit Sequences were assembled' 
using the Wisconsin GCG package. 

Comparison of both DNA and amino acid sequences for human-NiG AMvar (SEQ ID 
N0:1) and the published NiCAM sequence (Lane et al) revealed that they were >99% 

20 identical in overlapping regions. The DNA sequence of human NiCAMvar of SEQ ID 
N0:1 is 77.1% identical to that of chick gene while the amino acid sequence (SEQ ID 
N0:2) is 80% identical. Evidence for alternative splicing of AE19, AE12 and AE93, was 
obtained either through sequencing of cDNA clones or PGR products from human cDNA 
and in addition two novel regions, AEIOK and AEIOL were found to be differentially 

25 absent in the present (AEIOK) versus Lanes (AEIOL) sequence (see Figure 2), 

a) Exon structure: 

EST99669 was found to contain a contiguous genomic sequence not homologous to any 
sequence of the chick gene. Examination of this sequence revealed a splice doncx- 

30 consensus sequence. This probably corresponds to an intron:exon boundary represented in 
cDNA due to incomplete mRNA processing. A donor and an acceptor were present in 
EST373834. In addition to alternatively spliced regions AE12 and AE93, identified by 
Lane et al (1996), AE19 was also found to be absent from some cDNA fragments 
obtained from human fetal brain cDNA. In addition, two novel alternatively spliced 

35 regions, encoding 10-amino-acid sections, were identified (see Figure 2). AE93 was 
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absent in EST237133 which was from human adrenal cDNA library while both AE12 and 
AE93 were observed in PCR products from human adult brain cDNA. 

b) Results from Northern blots: 

5 A mRNA band of "7.0kb was observed for human brain, placenta, pancreas, adrenal 
medulla and cortex, thyroid, and testis tissues. The results also showed that this gei^ is 
highly expressed in brain, pancreas, and adrenal cortex tissues (the levels of mRNA on the 
blots used are controUed at Qontech and samples are tested for their integrity by 
hybridisation with an actin gene probe). 

10 

c) Chromosomal localisation: 

To obtain precise localisation of the human NrCAM locus . primers from an intron 
(sbpl2) and from an exon (sbpl3) were designed according to the sequence of 
EST373834, Tliese primers produced a PCR product of 214bp, and were used to screen 
15 for the presence of this gene in the Genebridge 4 radiation hy Wd panel. The results were 
analysed by tiie WIGCR experimental mapping server and arc shown in Table 1. Tliese 
data place the human NiCAM gene on the long arm of chromosonie 7 at 7q21-22 between 
D7S666 and D7S658. 

20 Table I: Data vectors obtained from testing Genebridge 4 Radiation Hydrids panel using 
primers sbp 1 2 and 13. 

00000 00100 lOOOO 00010 00000 10110 00001 10000 OOlOO 00021 
00000 01 100 10010 1 1000 01000 1 1 120 01000 210 
00000 00100 10001 00010 00000 10110 00000 00000 01100 00000 

00000 01 100 10010 1 1000 1 1000 11 1 10 01000 010 
00000 00100 10000 00011 00000 10110 00001 10000 01100 00001 

00000 01100 10010 11000 01002 1112001000 210 

Each digit corresponds to one of 93 ceil lines in the radiation hybrid panel Oandl 
represent negative and positive PCR assays respectively, 2 shows that the assay was 
contradictory between duplicate experiments or was untested. 



D7S666 
10101 
25 NrCAM 
10101 

D7S658 
10101 

30 
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Since pancreatic function is intimately involved in the development of diabetes* 
NrC AMvar becomes a target molecule in the management of this disease. This suggestion 
is supported by the genomic mapping data. A locus for non-insulin-dependent diabetes 
melUtus (NIDDM), also called Type II diabetes, has been mapped to the same region of 
5 chromosome 7 as NrC AMvar (Prochazka, 1995). 
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SEQUENCE LISTING 

INFORMATION FOR SEQ ID NO:l: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3997 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

15 



20 



25 



30 



35 



CCATCGTAAT 


TCGCCTAATG 


CAGCTTAAAA 


TAATGCCGAA 


AAAGAAGCGC 


TTATCTGCGG 


60 


GCAGAGTGCC 


CCTGATTCTC 


TTCCTGTGCC 


AGATGATTAG 


TGCACTGGAA 


GTACCTCTTG 


120 


ATCCAAAACT 


TCTTGAAGAC 


TTGGTACAGC 


CTCCAACCAT 


CACCCAACAG 


TCTCCAAAAG 


180 


ATTACATTAT 


TGACCCTCGG 


GAGAATATTG 


TAATCCAGTG 


TGAAGCCAAA 


GGGAAACCGC 


240 


CCCCAAGCTT 


TTCCTGGACC 


CGTAATGGGA 


CTCATTTTGA 


CATCGATAAA 


GACCCTCTGG 


300 


TCACCATGAA 


GCCTGGCACA 


GGAACGCTCA 


TAATTAACAT 


CATGAGCGAA 


GGGAAAGCTG 


360 


AGACCTATGA 


AGGAGTCTAT 


CAGTGTACAG 


CAAGGAACGA 


ACGCGGAGCT 


GCAGTTTCTA 


420 


ATAACATTGT 


TGTCCGCCCA 


TCCAGATCAC 


CATTGTGGAC 


CAAAGAAAAA 


CTTGAACCAA 


480 


TCACACTTCA 


AAGTGGTCAG 


TCTTTAGTAC 


TTCCCTGCAG 


ACCCCCAATT 


GGATTACCAC 


540 


CACCTATAAT 


ATTTTGGATG 


GATAATTCCT 


TTCAAAGACT 


TCCACAAAGT 


GAGAGAGTTT 


600 


CTCAAGGTTT 


GAATGGGGAC 


CTTTATTTTT 


CCAATGTCCT 


CCCAGAGGAC 


ACCCGCGAAG 


660 


ACTATATCTG 


TTATGCTAGA 


TTTAATCATA 


CTCAAACCAT 


ACAGCAGAAG 


CAACCTATTT 


720 


CTGTGAAGGT 


GATTTCAGTG 


GATGAATTGA 


ATGACACTAT 


AGCTGCTAAT 


TTGAGTGACA 


780 


CTGAGTTTTA 


TGGTGCTAAA 


TCAAGTAGAG 


AGAGGCCACC 


AACATTTTTA 


ACTCCAGAAG 


840 


GCAATGCAAG 


TAACAAAGAG 


GAATTAAGAG 


GAAATGTGCT 


TTCACTGGAG 


TGCATTGCAG 


9.00 


AAGGACTGCC 


TACCCCAATT 


ATTTACTGGG 


CAAAGGAAGA 


TGGAATGCTA 


CCCAAAAACA 


960 


GGACAGTTTA 


TAAGAACTTT 


GAGAAAACCT 


TGCAGATCAT 


TCATGTTTCA 


GAAGCAGACT 


1020 


CTGGAAATTA 


CCAATGTATA 


GCAAAAAATG 


CATTAGGAGC 


CATCCACCAT 


ACCATTTCTG 


1080 


TTAGAGTTAA 


AGCGGCTCCA 


TACTGGATCA 


CAGCCCCTCA 


AAATCTTGTG 


CTGTCCCCAG 


1140 


GAGAGGATGG 


GACCTTGATC 


TGCAGAGCTA 


ATGGCAACCC 


CAAACCCAGA 


ATTAGCTGGT 


1200 


TAACAAATGG" 


"A'GTCCCAATA" 


'GAAATTGCCC" 


CTGATGACCC 


CAGCAGAAAA 


ATAGATGGCG 


1260 
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ATACCATTAT TTTTTCAAAT GTTCAAGAAA GATCAAGTGC AGTATATCAG TGCAATGCCT 1320 
CTAATGAATA TGGATATTTA CTGGCAAACG CATTTGTAAA TGTGCTGGCT GAGCCACCAC 1380 
GAATCCTCAC ACCTGCAAAC ACACTCTACC AGGTCATTGC AAACAGGCCT GCTTTACTAG 144 0 
ACTGTGCCTT CTTTGGGTCA CCTCTCCCAA CCATCCAGTG GTTTAAAGGA GCTAAAGGAA 1500 
GTGCTCTTCA TGAAGATATT TATGTTTTAC ATGAAAATGG AACTTTGGAA ATTCCTGTGG 1560 
CCCAAAAGGA CAGTACAGGA ACTTATACGT GTGTTGCAAG GAATAAATTA GGGATGGCGA 1620 
AGAATGAAGT TCACTTAGAA ATCAAAGATC CTACATGGAT CGTTAAACAG CCCGAATATG 1680 
CAGTTGTGCA AAGAGGGAGC ATGGTGTCCT TTGAATGCAA AGTGAAACAT GATCACACCT 1740 
TATCCCTCAC TGTCCTGTGG CTGAAGGACA ACAGGGAACT GCCCAGTGAT GAAAGGTTCA IB 00 
CTGTTGACAA GGATCATCTA GTGGTAGCTG ATGTCAGTGA CGATGACAGC GGGACCTACA 1860 
CGTGTGTGGC CAACACCACT CTGGACAGCG TCTCCGCCAG CGCTGTGCTT AGCGTTGTTG 1920 
CTCCTACTCC AACTCCAGCT CCCGTTTACG ATGTCCCAAA TCCTCCGCTT GACTTAGAAC 1980 
TGACAGATCA ACTTGACAAA AGTGTTCAGC TGTCATGGAC CCCAGGCGAT GACAACAATA 2040 
GCCCCATTAC AACAATTCAT GACGAATATG AAGATGCAAT GCACAAGCCA GGGCTGTGGC 2100 
ACCACCAAAC TGAAGTTTCT GGAACACAGA CCACAGCCCA GCTGAAGCTG TCTCCTTACG 2160 
TGAACTACTC CTTCCGCGTG ATGGCAGTGA ACAGCATTGG GAAGAGCTTG CCCAGCGAGG 2220 
CCTCTGAGCA GTATTTGACG AAAGCCTCAG AACCAGATAA AAACCCCACA GCTGTGGAAG 2280 
GACTGGGATC AGAGCCTGAT AATTTGGTGA TTACGTGGAA GCCCTTGAAT GGTTTCGAAT 2340 
TTAATGGGCC AGGCCTTCAG TACAAAGTTA GCTGGCGCCA GAAAGTTGGT GATGATGAAT 2400 
GGACATCTGT GGTTGTGGCA AATGTATCCA AATATATTGT TTCAGGCACG CCAACCTTTG 2460 
TTCCATACCT GATCAAAGTT CAGGCCCTGA ATGACATGGG GTTTGCCCCC GAGCCAGCTG 2 520 
TAGTCATGGG ACATTCTGGA GAAGACCTCC CAATGGTGGC TCCTGGGAAC GTGCGTGTGA 2 580 
ATGTGGTGAA CAGTACCTTA GCCGAGGTGC ACTGGGACCC AGTACCTCTG AAAAGCATCC 2640 
GAGGACACCT ACAAGGCTAT CGGATTTACT ATTGGAAGAC CCAGAGTTCA TCTAAAAGAA 2700 
ACAGACGTCA CATTGAGAAA AAGATCCTCA CCTTCCAAGG CAGCAAGACT CATGGCATGT 2760 
TGCCGGGGCT AGAGCCCTTT AGCCACTACA CACTGAATGT CCGAGTGGTC AATGGGAAAG 2 820 
GGGAGGGCCC AGCCAGCCCT GACAGAGTCT TTAATACTCC AGAAGGAGTC CCCAGCGTTC 2 880 
CCTCGTCTTT GAAGATTGTG AATCCAACAC TGGACTCTCT CACTTTGGAA TGGGATCCAC 2940 
CGAGCCACCC GAATGGCATT TTGACAGAGT ACACCTTAAA GTATCAGCCA ATTAACAACA 3000 
CACATGAATT AGGCCCTCTG GTAGATTTGA AAATTCCTGC CAACAAGACA CGGTGGACTT 3060 
TAAAAAAITT AAATTTCACC ACTCGATATA AGTTTTATTT CTATGCACAA ACATCAGCAG 3120 
GATCAGGAAG TCAAATTACA GAGGAAGCAG TAACAACTGT GGATGAAGCT GGTATTCTTC 3180 
CACCTGATGT AGGTGCAGGC AAAGTTCAAG CAGTAAATCC CAGGATCAGC AATCTTACTG 324 0 
CTGCAGCTGC TGAAACCTAT GCCAATATCA GTTGGGAATA TGAGGGACCA GAGTATGCCA 3300 
ACTTTTATGT TGAATATGGT GTAGCAGGCA GCAAAGAAGA ATGGAGAAAA GAAATTGTAA 3360 
ATGGTTCTCG GAGCTTCTTT GGGTTAAAGG GTCTAATGCC AGGAACAGCA TACAAGTTTC 3 4 20 
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GAGTTGGTGC TGTGGGGGGA CCCCGGTTTG TGAGTTCAGA GGGTGTGTTT GAGACAGGCC 3480 

CAGCGATGGC AAGCCGGCAG GTGGATATTG CAACTCAGGG CTGGTTCATT GGTCTGATGT 3540 

GTGCTGTTGC TCTCCTTATC TTAATTTTGC TGATTGTTTG CTTCATCAGA AGAAACAAGG 3600 

GTGGTAAATA TCCAGTTAAA GAAAAGGAAG ATGCCCATGC TGACCCTGAA ATCCAGCCTA 3660 

TGAAGGAAGA TGATGGGACA TTTGGAGAAT ACAGTGATGC AGAAGACCAC AAGCCTTTGA 3720 

AAAAAGGAAG TCGAACTCCT TCAGACAGGA CTGTGAAAAA AGAAGATAGT GACGACAGCC 3780 

TACTTGACTA TGGAGAAGGG GTTAATGGCC AGTTCAATGA GGATGGCTCC TTTATTGGAC 3840 

AATACAGTGG TAAAAAAGAG AAAGAGCCGG CTGAAGGAAA CGAAAGCTCA GAGGCACCTT 3900 

CtCCTGTCAA CGCCATGAAT TCCTTTGTTT AATCATAGAA CTTGATTCCG ATGATGTCTT 3960 

TACAGTTTGT TTGCTATTGT CCATCCAGGT TGTACTG 3997 

INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1304 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gin Leu Lys lie Met Pro Lys Lys Lys Arg Leu Ser Ala Gly Arg 

15 10 15 

Val Pro Leu lie Leu Phe Leu Cys Gin Met He Ser Ala Leu Glu Val 

20 25 30 

Pro Leu Asp Pro Lys Leu Leu Glu Asp Leu Val Gin Pro Pro Thr He 

35 40 45 

Thr Gin Gin Ser Pro Lys Asp Tyr He He Asp Pro Arg Glu Asn He 

50 55 60 

Val He Gin Cys Glu Ala Lys Gly Lys Pro Pro Pro Ser Phe Ser Trp 

65 70 75 80 

Thr Arg Asn Gly Thr His Phe Asp He Asp Lys Asp Pro Leu Val Thr 

85 90 95 

"Met~Lys~Pr%5~Glv^Tfir~Giy~Thr^LW"Ile~Ile Asn He Met Ser Glu Gly 
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100 105 110 

Lys Ala Glu Thr Tyr Glu Gly Val Tyr Gin Cys Thr Ala Arg Asn Glu 

115 120 125 

Arg Gly Ala Ala Val Ser Asn Asn He Val Val Arg Pro Ser Arg Ser 

130 135 140 

Pro Leu Trp Thr Lys Glu Lys Leu Glu Pro He Thr Leu Gin Ser Gly 
145 150 155 160 

Gin Ser Leu Val Leu Pro Cys Arg Pro Pro lie Gly Leu Pro Pro Pro 

165 170 175 

He He Phe Trp Met Asp Asn Ser Phe Gin Arg Leu Pro Gin Ser Glu 

180 185 190 

Arg Val Ser Gin Gly Leu Asn Gly Asp Leu Tyr Phe Ser Asn Val Leu 

195 200 205 

Pro Glu Asp Thr Arg Glu Asp Tyr He Cys Tyr Ala Arg Phe Asn His 

210 215 220 

Thr Gin Thr He Gin Gin Lys Gin Pro He Ser Val Lys Val He Ser 
225 230 235 240 

Val Asp Glu Leu Asn Asp Thr He Ala Ala Asn Leu Ser Asp Thr. Glu 

245 250 255 

Phe Tyr Gly Ala Lys Ser Ser Arg Glu Arg Pro Pro Thr Phe Leu Thr 

260 265 270 

Pro Glu Gly Asn Ala Ser Asn Lys Glu Glu Leu Arg Gly Asn Val Leu 

275 280 285 

Ser Leu Glu Cys He Ala Glu Gly Leu Pro Thr Pro He He Tyr Trp 

290 295 300 

Ala Lys Glu Asp Gly Met Leu Pro Lys Asn Arg Thr Val Tyr Lys Asn 
305 310 315 320 

Phe Glu Lys Thr Leu Gin He He His Val Ser Glu Ala Asp Ser Gly 

325 330 335 

Asn Tyr Gin Cys He Ala Lys Asn Ala Leu Gly Ala He His His Thr 

340 345 350 

He Ser Val Arg Val Lys Ala Ala Pro Tyr Trp He Thr Ala Pro Gin 

355 360 365 

Asn Leu Val Leu Ser Pro Gly Glu Asp Gly Thr Leu He Cys Arg Ala 

370 375 380 

Asn Gly Asn Pro Lys Pro Arg He Ser Trp Leu Thr Asn Gly Val Pro 
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385 390 395 400 

lie Glu lie Ala Pro Asp Asp Pro Ser Arg Lys He Asp Gly Asp Thr 

405 410 415 

He He Phe Ser Asn Val Gin Glu Arg Ser Ser Ala Val Tyr Gin Cys 

420 425 430 

Asn Ala Ser Asn Glu Tyr Gly Tyr Leu Leu Ala Asn Ala Phe Val Asn 

435 440 445 

Val Leu Ala Glu Pro Pro Arg He Leu Thr Pro Ala Asn Thr Leu Tyr 

450 455 
Gin Val He Ala Asn Arg Pro Ala Leu Leu Asp Cys Ala Phe Phe Gly 
465 470 475 480 

Ser Pro Leu Pro Thr He Gin Trp Phe Lys Gly Ala Lys Gly Ser Ala 

485 490 495 

Leu His Glu Asp He Tyr Val Leu His Glu Asn Gly Thr Leu Glu He 

500 505 510 

Pro Val Ala Gin Lys Asp Ser Thr Gly Thr Tyr Thr Cys Val Ala Arg 

515 520 525 

Asn Lys Leu Gly Met Ala Lys Asn Glu Val His Leu Glu He Lys Asp 

530 535 
Pro Thr Trp He Val Lys Gin Pro Glu Tyr Ala Val Val Gin Arg Gly 
545 550 555 560 

Ser Met Val Ser Phe Glu Cys Lys Val Lys His Asp His Thr Leu Ser 

565 570 575 

Leu Thr Val Leu Trp Leu Lys Asp Asn Arg Glu Leu Pro Ser Asp Glu 

580 585 590 

Arg Phe Thr Val Asp Lys Asp His Leu Val Val Ala Asp Val Ser Asp 

595 600 605 

Asp Asp Ser Gly Thr Tyr Thr Cys Val Ala Asn Thr Thr Leu Asp Ser 

610 615 620 

Val Ser Ala Ser Ala Val Leu Ser Val Val Ala Pro Thr Pro Thr Pro 
625 630 635 640 

Ala Pro Val Tyr Asp Val Pro Asn Pro Pro Leu Asp Leu Glu Leu Thr 

645 650 655 
Asp Gin Leu Asp Lys Ser Val Gin Leu Ser Trp Thr Pro Gly Asp Asp 
660 665 670 



Asn Asn Ser Pro He Thr Thr He His Asp Glu Tyr Glu Asp Ala Met 
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675 680 685 

His Lys Pro Gly Leu Trp His His Gin Thr Glu Val Ser Gly Thr Gin 

690 695 700 

Thr Thr Ala Gin Leu Lys Leu Ser Pro Tyr Val Asn Tyr Ser Phe Arg 
705 710 715 720 

Val Met Ala Val Asn Ser lie Gly Lys Ser Leu Pro Ser Glu Ala Ser 

725 730 735 

Glu Gin Tyr Leu Thr Lys Ala Ser Glu Pro Asp Lys Asn Pro Thr Ala 

740 745 750 

Val Glu Gly Leu Gly Ser Glu Pro Asp Asn Leu Val" lie Thr Trp Lys 

755 760 765 

Pro Leu Asn Gly Phe Glu Phe Asn Gly Pro Gly Leu Gin Tyr Lys Val 

770 775 780 

Ser Trp Arg Gin Lys Val Gly Asp Asp Glu Trp Thr Ser Val Val Val 
785 790 795 800 

Ala Asn Val Ser Lys Tyr lie Val Ser Gly Thr Pro Thr Phe Val Pro 

805 810 815 

Tyr Leu lie Lys Val Gin Ala Leu Asn Asp Met Gly Phe Ala Pro Glu 

820 825 830 

Pro Ala Val Val Met Gly His Ser Gly Glu Asp Leu Pro Met Val Ala 

835 840 845 

Pro Gly Asn Val Arg Val Asn Val Val Asn Ser Thr Leu Ala Glu Val 

850 855 860 

His Trp Asp Pro Val Pro Leu Lys Ser lie Arg Gly His Leu Gin Gly 
865 870 875 880 

Tyr Arg lie Tyr Tyr Trp Lys Thr Gin Ser Ser Ser Lys Arg Asn Arg 

885 . 890 895 

Arg His lie Glu Lys Lys He Leu Thr Phe Gin Gly Ser Lys Thr His 

900 905 910 

Gly Met Leu Pro Gly Leu Glu Pro Phe Ser His Tyr Thr Leu Asn Val 

915 920 925 

Arg Val Val Asn Gly Lys Gly Glu Gly Pro Ala Ser Pro Asp Arg Val 

930 935 940 

Phe Asn Thr Pro Glu Gly Val Pro Ser Val Pro Ser Ser Leu Lys lie 
945 950 955 960 

Val Asn Pro Thr Leu Asp Ser Leu Thr Leu Glu Trp Asp Pro Pro Ser 
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965 970 975 

His Pro Asn Gly He Leu Thr Glu Tyr Thr Leu Lys Tyr Gin Pro He 

980 985 990 

Asn Asn Thr His Glu Leu Gly Pro Leu Val Asp Leu Lys He Pro Ala 

995 1000 1005 

Asn Lys Thr Arg Trp Thr Leu Lys Asn Leu Asn Phe Thr Thr Arg Tyr 

1010 1015 1020 

Lys Phe Tyr Phe Tyr Ala Gin Thr Ser Ala Gly Ser Gly Ser Gin He 
1025 1030 1035 104 

Thr Glu Glu Ala Val Thr Thr Val Asp Glu Ala Gly He Leu Pro Pro 

1045 1050 1055 

Asp Val Gly Ala Gly Lys Val Gin Ala Val Asn Pro Arg He Ser Asn 

1060 1065 1070 

Leu Thr Ala Ala Ala Ala Glu Thr Tyr Ala Asn He Ser Trp Glu Tyr 
15 1075 1080 1085 

Glu Gly Pro Glu Tyr Ala Asn Phe Tyr Val Glu Tyr Gly Val Ala Gly 

1090 1095 1100 

Ser Lys Glu Glu Trp Arg Lys Glu He Val Asn Gly Ser Arg Ser Phe 
1105 1110 1115 

Leu Lys Gly Leu Mec Pro Gly Thr Ala Tyr Lys Phe Arg Val 
1125 1130 1135 

Gly Ala Val Gly Gly Pro Arg Phe Val Ser Ser Glu Gly Val Phe Glu 

1140 1145 1150 

Thr Gly Pro Ala Met Ala Ser Arg Gin Val Asp He Ala Thr Gin Gly 
25 1155 1160 1165 

Trp Phe He Gly Leu Met Cys Ala Val Ala Leu Leu He Leu He Leu 

1170 1175 1180 

Leu He Val Cys Phe He Arg Arg Asn Lys Gly Gly Lys Tyr Pro Val 
1185 1190 1195 120 

Lys Glu Lys Glu Asp Ala His Ala Asp Pro Glu He Gin Pro Met Lys 

1205 1210 1215 

Glu Asp Asp Gly Thr Phe Gly Glu Tyr Ser Asp Ala Glu Asp His Lys 

1220 1225 1230 
Pro Leu Lys Lys Gly Ser Arg Thr Pro Ser Asp Arg Thr Val Lys Lys 
1235 1240 ^1-245 
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Glu Asp Ser Asp Asp Ser Leu Leu Asp Tyr Gly Glu Gly Val Asn Gly 
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1250 125S 1260 

Gin Phe Asn Glu Asp Gly Ser Phe He Gly Gin Tyr Ser Gly Lys Lys 
1265 1270 1275 128 

Glu Lys Glu Pro Ala Glu Gly Asn Glu Ser Ser Glu Ala Pro Ser Pro 

1285 1290 1295 

Val Asn Ala Met Asn Ser Phe Val 
1300 
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What is claimed is: 



1 . An isolated polynucleotide comprising a nucleoUde sequence 
encoding the NrCAMvar polypepdde of SEQ ID:N02; or a nudeoUde sequence 
5 complementary to said nucleotide sequence, 

2. The polynucleotide of claim 1 which is DN A or RNA, 

3. The polynucleotide of claim 2 wherein said nucleotide sequence comprises 
1 0 the NiCAM var polypeptide encoding sequence contained in SEQ ID:N02. 

4. The polynucleotide of SEQ ID NO: 1. 

5. A polynucleotide probe or primer comprising at least 15 contiguous 
15 nucleotides of the polynucleotide of claim 3. 



6. A DNA or RNA molecule comprising an expression system, wherein said 
expression system is capable of producing a NrCAMvar poIypq)tide conq>rTsing an amino 
acid sequence of SEQ ID:N02 when said expression system is present in a compatible host 
20 ceU. 



7. A host cell comprising the expression system of claim 7. 

8. A process for producing a NrCAMvar polypepUde comprising culturing a 
25 host of claim 7 and under condi tions sufficient for the production of said polypeptide. 

9. The process of claim 8 which further includes recovering the polypeptide 
from the culture. 



10. A process for producing a cell which produces a NrCAMvar polypeptide 
thereof comprising transforming or transfecting a host cell with the expression system of 
claim 6 such that the host cell, under appropriate culture conditions, produces a 
NrCAMvar polypeptide. 

1 1. Cells produced b y the proce5K of claim in. 
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12. A polypeptide which comprises the amino acid sequence of SEQ ID 

NO:2. 

13. Hie polypeptide encoded in SEQ ID:N02. 

5 

14. A NrCAMvar polypepUde prqjared by the method of claim 9. 

15. An antibody immunospecific for tl^ NrCAMvar polypeptide of claim 12. 

10 16, A method for the treatment of a subject in need of enhanced NrCAMvar 

polypeptide activity comprising: 

(a) administering to the subject a therapeutically effective amount of an 
agonist to said polypeptide; and/or 

(b) providing to the subjea NrCAMvar polynucleotide in a fom so as to efiBect 
15 production of said polypeptide activity in vivo, 

17. A method for the treatment of a subject having need to inhibit NrCAMvar 
polypeptide activity comprising: 

(a) administering to the subject a therapeutically effective amount of an 
20 antagonist to said polypeptide; and/or 

(b) administering to the subject a nucleic acid molecule that inhibits the 
expression of the nucleotide sequence encoding said polypeptide; and/or 

(c) administering to the subject a therapeutically effective amount of a 
polypeptide that competes with said polypeptide for its ligand, substrate , or receptor. 

25 

18. A process for diagnosing a disease or a susceptibility to a disease in a 
subject related to expression or activity of NrCAMvar polypeptide in a subject comprising: 

(a) determining the presence or absence of a mutation in the nucleotide 
sequence encoding said NrCAMvar polypeptide in the genome of said subject; and/OT 
30 (b) analyzing for the presence or amount of the NrCAMvar polypeptide 

expression in a sample derived from said subject 

19. A method for identifying compounds which inhibit (antagonize) or agonize 
the NrCAMvar polypeptide which comprises: 
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(a) contacting a candidate compound with ceUs which express the NrCAMvar 
polypepdde (or ceU membrane expressing NiC AM var pdypepiide) or respond to NiCAMvar 
polypeptide; and 

(b) observing the binding, or stimulation cr inhibition of a functional response; or 
5 comparing the abiUty of the cells (or ceU membrane) wWch were contacted with ite 

compounds with the same cells which were not contacted for NiCAMvar polypeptide activity. 

20. An agonist identified by the method of claim 19. 
10 21 , An antagonist identified by the method of claim 19. 

22. A polynucleotide consisting essentially of a DNA sequence obtainable by 
screening an appropriate library containing the NrCAMvar gene under stringent 
hybridization conditions with a probe having the sequence of SEQ ID:NO 1 or a fragment 

1 5 thereof; and isolating said DNA sequence. 

23, A polypeptide obtainable by expressing a nucleotide sequence comprising 
that of SEQ ID NO;l 

2"^- ^ metiiod for the treatment of diabetes, obesity or cancer which comprises 
administering to tiie subject a therapeutically effective amount of a modulator of 
NiCAMvar polypeptide activity. 

25. A process according to claim 20 for diagnosing presence of or 
25 susceptibility to diabetes, obesity or cancer. 
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Figure 1 DN A sequence and amino acid sequence of NiCAMvar (SEQ ID NOS : 1 and 2 
respectively 

5 SEQ ID NO: 1 

CCATCGTAATTCGCCTAATGCAGCTTAAAATAATGCCGAAAAAGAAGCGCTTATCTGCGG 
GCAGAGTGCCCCTGATTCTCTTCCTGTgCCAGATGATTAGTGCACTGGAAGTACCTCTTG 
ATCCAAAACTTCTTGAAGACTTGGTACAGCCTCCAACCATCACCCAACAGTctccAAAAG 
ATTACATTATTGACCCTCGGGAGAATATTGTAATCCAGTGTGAAGCCAAAGGGAAACCGC 

1 0 CCCC AAGCTTTTCCTGG ACCCGT AATGGG ACTC ATTTTG AC ATCG AT AAAG ACCCTCTGG 
TCACCATGAAGCCTGGCACAGGAACGCTCATAATTAACATCATGAGCGAAGGGAAAGCTG 
AGACCTATGAAGGAGTCTATCAGTGTACAGCAAGGAACGAACGCGGAGCTGCAGTTTCTA 
ATAACATTGTTGTCCGCCCATCCAGATCACCATTGTGGACCAAAGAAAAACTTGAACCAA 
TCACACTTCAAAGTGGTCAGTCTTTAGTACTTCCCTGCAGACCCCCAATTGGATTACCAC 

1 5 CACCTATAATATTTTGGATGG ATAATTCCTTTCAAAG ACTTCCAC AAAGTGAGAGAGTTT 
CTCAAGGTTTGAATGGGGACCTTTATTTTTCCAATGTCCTCCCAGAGGACACCCGCGAAG 
ACTATATCTGTTATGCTAGATTTAATCATACTCAAACCATACAGCAGAAGCAACCTATTT 
CTGTGAAGGTGATTTCAGTGGATGAATTGAATGACACTATAGCTGCTAATTTGAGTGACA 
- CTGAGTTTTATGGTGCTAAATCAAGTAGAGAGAGGCCACCAACATTTTTAACTCCAGAAG 

20 

GCAATGCAAGTAACAAAGAGGAATTAAGAGGAAATGTGCTTTCACTGGAGTGCATTGCAG 
AAGGACTGCCTACCCCAATTATTTACTGGGCAAAGGAAGATGGAATGCTACCCAAAAACA 
GGACAGTTTATAAGAACTTTGAGAAAACCTTGCAGATCATTCATGTTTCAGAAGCAGACT 
CTGGAAATTACCAATGTATAGCAAAAAATGCATTAGGAGCCATCCACCATACCATTTCTG 
TTAGAGTTAAAGCGGCTCCATACTGGATCACAGCCCCTCAAAATCTTGTGCTGTCCCCAG 

25 GAGAGGATGGGACCTTGATCTGCAGAGCTAATGGCAACCCCAAACCCAGAATTAGCTGGT 
TAACAAATGGAGTCCCAATAGAAATTGCCCCTGATGACCCCAGCAGAAAAATAGATGGCG 
ATACCATTATTTTTTCAAATGTTCAAGAAAGATCAAGTGCAGTATATCAGTGCAATGCCT 
CTAATGAATATGGATATTTACTGGCAAACGCATTTGTAAATGTGCTGGCTGAGCCACCAC 
GAATCCTCACACCTGCAAACACACTCTACCAGGTCATTGCAAACAGGCCTGCTTTACTAG 

30 ACTGTGCCTTCTTTGGGTCACCTCTCCCAACCATCCAGTGGTTTAAAGGAGCTAAAGGAA 
GTGCTCTTCATGAAGATATTTATGTTTTACATGAAAATGGAACTTTGGAAATTCCTGTGG 
CCCAAAAGGACAGTACAGGAACTTATACGTGTGTTGCAAGGAATAAATTAGGGATGGCGA 
AGAATGAAGTTCACTTAGAAATCAAAGATCCTACATGGATCGTTAAACAGCCCGAATATG 
CAGTTGTGCAAAgAGGGAGCATGGTGTCCTTTGAATGCAAAGTGAAACATGATCACACCT 

35 TATCCCTCACTGTCCTGTGGCTGAAGGACAACAGGGAACTGCCCAGTGATGAAAGGTTCA 
CTGTTGACAAGGATCATCTAGTGGTAGCTGATGTCAGTGACGATGACAGCGGGACCTACA 
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CTCCTACTCCAAOTCCAGCTCCCGTTTACGATGTCCCAAATCCTCCGCTTGACITAGAA^ 
TGACAGATCAACTTGACAAAAGTGTTCAGCTGTCATGGACCCCAGGCGATGACAACAATA 
GCCCCATTACAACAATTCATGACGAATATGAAGATGCAATGCACAAGCCAGGGCTGTGGC 
ACCACCAAACTGAAGTTTCTGGAACACAGACCACAGCCCAGCTGAAGCTGTCTCCTTACG 
5 TGAACTACTCCTTCCGCGTGATGGCAGTGAACAGCATTGGGAAGAGCTTGCCCAGCGAGG 
CCTCTGAGCAGTATTTGACGAAAGCCTCAGAACCAGATAAAAACCCCACAGCTGTGGAAG 
GACTGGGATCAGAGCCTGATAATTTGGTGATTACGTGGAAGCCCTTGAATGGTTTCGAAT 
TTAATGGGCCAGGCCTTCAGTACAAAGTTAGCTGGCGCCAGAAAGTTGGTGATGATGAAT 
GGACATCTGTGGTTGTGGCAAATGTATCCAAATATATTGTTTCAGGCACGCCAACCTTTG 
10 TTCCATACCTGATCAAAGTTCAGGCCCTGAATGACATGGGGTTTGCCCCCGAGCCAGCTG 
TAGTCATGGGACATTCTGGAGAAGACCTCCCAATGGTGGCTCCTGGGAACGTGCGTGTGA 
ATGTGGTGAACAGTACCTTAGCCGAGGTGCACTGGGACCCAGTACCTCTGAAAAGCATCC 
GAGGACACCTACAAGGCTATCGGATTTACTATTGGAAGACCCAGAGTTCATCTAAAAGAA 
ACAGACGTCACATTGAGAAAAAGATCCTCACCTTCCAAGGCAGCAAGACTCATGGCATGT 

15 TGCCGGGGCTAGAGCCCTTTAGCCACTACACACTGAATGTCCGAGTGGTCAATGGGAAAG 
GGGAGGGCCCAGCCAGCCCTGACAGAGTCTTTAATACTCCAGAAGGAGTCCCCAGCGTTC 
CCTCGTCTTTGAAGATTGTGAATCCAACACTGGACTCTCTCACTTTGGAATGGGATCCAC 
CGAGCCACCCGAATGGCATTTTGACAGAGTACACCTTAAAGTATCAGCCAATTAACAACA 
CACATGAATTAGGCCCTCTGGTAGATrrcAAAATTCCTGCCAACAAGACACGGTGGACTT 

20 TAAAAAATTTAAATTTCACCACTCGATATAAGTTTTATTTCTATGCACAAACATCAGCAG 
GATCAGGAAGTCAAATTACAGAGGAAGCAGTAACAACTGTGGATGAAGCTGGTATTCTTC 
CACCTGATGTAGGTGCAGGCAAAGTTCAAGCAGTAAATCCCAGGATCAGCAATCTTACTG 
CTGCAGCTGCTGAAACCTATGCCAATATCAGTTGGGAATATGAGGGACCAGAGTATGCCA 
ACTTTTATGTTGAATATGGTGTAGCAGGCAGCAAAGAAGAATGGAGAAAAGAAATTGTAA 

25 ATGGTTCTCGGAGCTTCTTTGGGTTAAAGGGTCTAATGCCAGGAACAGCATACAAGTTTC 
GAGTTGGTGCTGTGGGGGGACCCCGGTTTGTGAGTTCAGAGGGTGTGTTTGAGACAGGCC 
CAGCGATGGCAAGCCGGCAGGTGGATATTGCAACTCAGGGCTGGTTCATTGGTCTGATGT 
GTGCTGTTGCTCTCCTTATCTTAATTTTGCTGATTGTTTGCrrCATCAGAAGAAACAAOT 
GTGGTAAATATCCAGTTAAAGAAAAGGAAGATGCCCATGCTGACCCTGAAATCCAGCCTA 

30 TGAAGGAAGATGATGGGACATTTGGAGAATACAGTGATGCAGAAGACCACAAGCCTTTGA 
AAAAAGGAAGTCGAACTCCTTCAGACAGGACTGTGAAAAAAGAAGATAGTGACGACAGCC 
TACTTGACTATGGAGAAGGGGTTAATGGCCAGTTCAATGAGGATGGCTCCTTTATTGGAC 
AATACAGTGGTAAAAAAGAGAAAGAGCCGGCTGAAGGAAACGAAAGCTCAGAGGCACCTT 
CTCCTGTCAACGCCATGAATTCCTTTGTTTAATCATAGAACTTGATTCCGATGATGTCTT 

35 

TACAGTTTGTTTGCTATTC3TCCATCCAGGTTGTACTG 
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SEQ ID N0:2 

MQLKIMPKKKRLSAGRVPLILFLCQMISALEVPLDPKLLEDLVQPPTITQQSPKDYIIDP 
5 RENIVIOCEAKGKPPPSFSWTRNGTHFDIDKDPLVTMKPGTGTLIINIMSEGKAETYEGV 
YQCTARNERGAAVSNNIWRPSRSPLWTKEKLEPITLQSGQSLVLPCRPPIGLPPPIIFW 
MDNSFORLPQSERVSQGU^GDLYFSNVLPEDTREDYICYARFNHTQTIQQKQPISVKVIS 
TOEUODTIAANLSDTEFYGAKSSRERPPTFLTPEGNASNKEELRGNVLSLECIAEGLPTP 
1 1 YWAKEDGMLPKNRTVYKNFEKTLOI IHVSEADSGNYQC I AKNALGAIHHTI SVRVKAA 

10 PYWITAPQNLVLSPGEDGTI.ICRANGNPKPRISWLTNGVPIEIAPDDPSRKIEX3DTIIFS 
NVQERSSAVYQCNASNEYGYLLANAFVNVUVEPPRILTPANTLYQVIANRPALLDCAFFG 
S PLPT IQWFKGAKGSALHEDI YVLHENGTLEI PVAQKDSTGT YTCVARNKLGMAKNEVHL 
EIKDPTWIVKQPEYAWQRGSMVSFECKVKHDHTLSLTVLWLKDNRELPSDERFTVDKDH 
LWADVSDDDSGTYTCVANTTLDSVSASAVLSWAPTPTPAPVYDVPNPPLDLELTDQLD 

15 KSVQLSOTPGDDNNSPITTIHDEYEDAMHKPGLVfflHQTEVSGTCnTAQl^LSPYVNYSFR 
VMAVNSIGKSLPSEASEQYLTKASEPDKNPTAVEGLGSEPDNLVITWKPLNGFEFNGPGL 
QYKVSWRQKVGDDEWTSVVVANVSKYIVSGTPTFVPYLIKVQALNDMGFAPEPAVVMGHS 
GEDLPMVAPGNVRVNWNSTLAEVHWDPVPLKSIRGHLQGYRIYYWKTQSSSKRNRRHIE 
KKILTFQGSKTHGMLPGLEPFSHYTLNVRWNGKGEGPASPDRVFNTPEGVPSVPSSLKI 

20 VNPTLDSLTLEWDPPSHPNGILTEYTLKYQPINNrrHELGPLVDLKIPANKTRWTUCN^ 
TTRYKFYFYAQTSAGSGSQITEEAVTTVDEAGILPPDVGAGKVQAVNPRISNLTAAAAET 
YANISWEYEGPEYANFYVEYGVAGSKEEWRKEIVNGSRSFFGLKGLMPGTAYKFRVGAVG 
GPRFVSSEGVFETGPAMASRQVDIATOGWFIGLMCAVALLILILLIVCFIRRNKGGKYPV 
KEKEDAHADPEIQPMKEDDGTFGEYSDAEDHKPLKKGSRTPSDRTVKKEDSDDSLLDYGE 

25 GVNGQFNEDGSFIGQYSGKKEKEPAEGNESSEAPSPVNAMNSFV 

Figure 2. Comparison of human NrCAM. NrCAMvar and chick NiCAM. The 
sequences compared are those of the current invention (Human 1), of Lane et al 
30 (Human2) and chick. The nucleotide numbering refers to that of the Humanl 
sequence, 

1512 AEIOKI 

Humanl GAAGATATTT ATGTTTTACA TGAAAATGGA ACTTTG 

35 Human2 GAAGATATTT ATGTTTTACA TGAAAATGGA ACTTTGCSAAA TCAAAGATGC 
Chick GGAAATGAAT ATGTTTTCCA TGATAATGGA ACCTTG 

Humanl gAAA TTCCTGTCGC CCAAAAGGAC AGTACAGGAA 

Human2 TACATGQATC CTTAAAGAAA TTCCTGTGGC CCAAAAGGAC AGTACAGGAA 
40 Chick ^ GAAA TTCCAGTGGC TCAGAAGGAT AGTACTGGCA 

Humanl CTTATACGTG TGTTGCAAGG AATAAATTAG GGATGGCGAA GAATGAAGTT 

Human2 CTTATACGTG TGTTGCAAGG AATAAATTAG GGATGGCAAA GAATGAAGTT 

Chick CATACACATG TGTTGCAAGG AATAAATTAG GGAAGACGCA AAATGAAGTA 
45 AE10K2 

Humanl CACTTAGAAA TCAAAGATCC TACArGCyiTC CTTAAACAGC CCGAATATGC 

Human2 CACTTA— CAGC CCGAATATGC 

Chick CAACTGGAAO TTAAAGACCC AACGATGATA ATTAAACAGC CACAGTACAA 
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10 



15 



Humanl AGTTGTGCAA AGAGGGAGCA TGGTCrrcCTT TGAATGCAAA GTGAAACATG 

Huinan2 AGTTGTGCAA AGAGGGAGCA TGGTGTCCTT TGAATGCAAA GTGAAACATG 

Chick AGTGATTCAG AGATCTGCCC AGGCTTCATT TGAGTGTGTA ATAAAACATG 

Humanl ATCACACCTT ATCCCTCACT GTCCTGTGGC TGAAGGACAA CAGGGAACTG 

HUman2 ATCACACCTT ATCCCTCACT GTCCTGTGGC TGAAGGACAA CAGGGAACTG 

Chick ATCCTACCTT AATACCAACA GTTATATGGC TGAAAGACAA TAATGAACTA 

Humanl CCCAGTGATG AAAGGTTCAC TGTTGACAAG GATCATCTAG TGGTAGCTGA 

Human2 CCCAGTGATG AAAGGTTCAC TGTTGACAAG GATCATCTAG TGGTAGCTGA 

Chick CCAGATGATG AAAGGTTTCT AGTTGGTAAA GACAACTTGA CCATTATGAA 

Humanl TGTCAGTGAC GATGACAGCG GGACCTACAC GTGTGTGGCC AACACCACTC 

Humna2 TGTCAGTGAC GATGACAGCG GGACCTACAC GTGTGTGGCC AACACCACTC 

Chick TGTAACTGAT AAAGATGATG GAACATATAC TTGCATAGTT AATACTACTC 

Humanl TGGACAGCGT CTCCGCCAGC GCTGTGCTTA GCGTTGTTGC TCCTACTCCA 

Human2 TGGACAGCGT CTCCGCCAGC GCTGTGCTTA GCGTTGTTGC TCCTACTCCA 

Chick TGGACAGTGT TTCAGCAAGT GCTGTGCTTA CTGTTGTTGC TGCTCCCCCA 



Human2 
Chick 



Humanl 
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